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Abstract 

We investigate the computational power of periodically iterated mor- 
phisms, also known as DOL systems with periodic control, PDOL systems 
for short. These systems give rise to a class of one-sided infinite sequences, 
called PDOL words. 

We construct a PDOL word with exponential subword complexity, 
thereby answering a question raised by Lepisto [22] on the existence of 
such words. We solve another open problem concerning the decidability 
of the first-order theories of PDOL words [23] ; we show it is already un- 
decidable whether a certain letter occurs in a PDOL word. This stands 
in sharp contrast to the situation for DOL words (purely morphic words), 
which are known to have at most quadratic subword complexity, and for 
which the monadic theory is decidable. 

The main result of our paper, leading to these answers, is that every 
computable word w £ E u can be embedded in a PDOL word ueT" with 
r D £ in the following two ways: (i) such that every finite prefix of to is a 
subword of u, and (ii) such that w is obtained from u by erasing all letters 
from F \ E. The PDOL system generating such a word u is constructed 
by encoding a Fractran program that computes the word w; Fractran is 
a programming language as powerful as Turing Machines. 

As a consequence of (ii), if we allow the application of finite state 
transducers to PDOL words, we obtain the set of all computable words. 
Thus the set of PDOL words is not closed under finite state transduction, 
whereas the set of DOL words is. It moreover follows that equality of PDOL 
words (given by their PDOL system) is undecidable. Finally, we show 
that if erasing morphisms are admitted, then the question of productivity 
becomes undecidable, that is, the question whether a given PDOL system 
defines an infinite word. 
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Introduction 



Morphisms for transforming and generating infinite words provide a fundamen- 
tal tool for formal languages, and have been studied extensively; we refer to [2 
and the bibliography therein. 

In this paper we investigate the class of infinite words generated by period- 
ically alternating morphisms jTQl [TTJ [2H E] ■ Instead of repeatedly applying a 
single morphism, one alternates several morphisms from a given (finite) set in a 
periodic fashion. Let us look at an example right away, and consider the most 
famous word generated by such a procedure, namely the Kolakoski word [21] 

K = 1 22 11 2 1 22 1 22 11 2 11 22 1 2 11 2 1 22 11 2 • • • 

which is defined such that K(0) = 1 and K(n) equals the length of the n-th 
run of K; here by a 'run' we mean a block of consecutive identical symbols. 
The Kolakoski word can be generated by alternating two morphisms on the 
starting word 12, ho for the even positions and h\ for the odd positions, defined 
as follows: 

h °- 2 -Ml hi: 2^22 
The first few iterations then are 

12 

h (l)hi(2) = 122 
ho(l) hi(2) ho(2) = 12211 
h (l) hi{2) fr„ (2) fri(l) fto(l) = 1221121 

It is known that the Kolakoski word is not purely morphic [TT], i.e, cannot 
be generated by iterating a single morphism. However it is an open problem 
whether it is a morphic word, i.e., the image of a purely morphic word under 
a coding (= letter-to- letter morphism). We shall use the 'DOL' terminology: 
DOL for purely morphic, CDOL for morphic, and PDOL for words generated by 
periodically alternating morphisms, like the Kolakoski word above. 

A natural characteristic of sequences is their subword complexity [HI HJ [2] • 
The subword complexity of a sequence u is a function Pl-)fj mapping n to the 
number of n- length words that occur in it. It is well-known that morphic words 
have at most quadratic subword complexity |12) . Lepisto [2 2) proves that for all 
r£K there is a PDOL word whose subword complexity is in i7(n r ); hence there 
are PDOL words that are not CDOL. It remained an open problem whether PDOL 
words can exhibit exponential subword complexity. This intriguing question 
formed the initial motivation for our investigations. We actually establish a 
stronger result from which the existence of such words can be derived, as we 
will describe next. 

The main results of our paper can be stated as follows: For every computable 
word w € there exists a PDOL word u such that 

I. all prefixes of w occur in u as subwords between special marker symbols 
( Theorem |^.ff[ ), 
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77. w is the subsequence of u obtained from selecting all letters from £ (The- 
orem \4-12\ ). 



The construction of the PDOL systems generating such words u makes use of 
Fractran [7J [5] , a Turing complete programming language invented by Conway, 
in the following way. First, in Section [3J we show how to employ Fractran to 
generate any computable infinite word. Then we encode Fractran programs as 
PDOL systems, and prove that the PDOL system correctly simulates the Fractran 
program and records its output, see Sections [3] and HJ 
Consequences of I and II are as follows: 

(1) There exist PDOL words with exponential subword complexity (Theorem 

EH. 

(2) It is undecidable to determine, given a PDOL system % and a letter 6, 
whether the letter b occurs (infinitely often) in the word generated by H 
(Theorem rog). 

(3) The first-order theory of PDOL words is undecidable (Corollary 14. 16|) . 

(4) Equality of PDOL words is undecidable fCorollarv l4.17|) . 

(5) The set of PDOL words is not closed under finite state transductions (Corol- 
lary HJ3]). 

All the above results concern PDOL systems whose morphisms are non-erasing. 
But we also study erasing PDOL systems, and find 

(6) It is undecidable to determine, on the input of an erasing PDOL system, 
whether it generates an infinite word (Theorem 13. 6p . 

The outline of the paper is as follows. In Sections [1] and [2] we introduce 
the dramatis personae of our story: PDOL systems and Fractran programs. We 
explain the workings of the Fractran algorithm, and how to program in this 
language. 

Then, as a steppingstone to our main result, we start with a proof of (6) in 
Section [3j This proof illustrates our key construction: encoding Fractran pro- 
grams as PDOL systems. We then modify and extend this encoding in Section[4] 
to prove Theorems 14.91 and 14.121 PDOL words can embed every computable 
word, in the sense of I and II above. We give a detailed example of the trans- 
lation, and prove (l)-(5) listed above. 

PDOL systems resulting from encoding Fractran programs can be quite large. 
For example, the system obtained from a simple binary counter (computing an 
infinite word with exponential subword complexity) consists of 

536393214598471230 

morphisms. We present a direct solution in Section [5J namely a PDOL system 
with 16 morphisms simulating such a counter. 
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1 DOL Systems with Periodic Control 



We use standard terminology and notations, see, e.g., [2]. Let E be a finite 
alphabet. We denote by E* the set of all finite words over E, by e the empty 
word, and by E + = E* \ {e} the set of finite non-empty words. 

The set of infinite words over E is E" = {x | x : N — > E}. On the set of all 
words E°° = E* UE" we define the metric d for all u, v £ E°° by d(u, v) = 2~™, 
where n is the length of the longest common prefix of u and v. 

We let E p = {0, . . . ,p - 1}. We write \x\ for the length of x £ E°°, with 

= oo if x is infinite. We call a word v £ E* a factor of x £ E°° if x = uvy 
for some u £ E* and y £ E°°, and say that v occurs at position \u\. For words 
u,v £ E*, we write u -< v if u is a strict prefix of v, i.e., if v = uu' for some 
v! £ E + , and use -< for its reflexive closure. 

A morphism is a map h : E* — > T* such that h(uv) — h{u)h{v) for all 
u, v £ E*, and can thus be defined by giving its values on the symbols of E. A 
morphism h is called erasing if h(a) = e for some a £ E, and k-uniform, with 
k £ N, if |/i(a)| = k for all a £ E; h is a coding if it is 1-uniform. 

Infinite sequences generated by periodically alternating morphisms, also 
called 'DOL words with periodic control' or just 'PDOL words' for short, were 
introduced in [TO] . These form a generalization of DOL words, also known as 
purely morphic words, which are obtained by iterating a single morphism. 

Definition 1.1. Let H = (ho, . . . , h v -\) be a tuple of morphisms hi : E* — > E*. 
We define the map H : E* — > E* as follows: 

H(a ai ■ ■ ■ a n ) = it ui ■••u n 

where Ui = hkip-i), with fc = i (mod p) and k £ E p . 

If s £ E* is such that s ^ ff(s), then the triple H = (E, iJ, s) is called a PDOL 
system. Then in the metric space (S 00 , d) the limit 

H u (s) = lim iP(s) 

exists, and we call H u (s) the PDOL word generated by H. We say that % is 
productive if H u (s) is infinite, and "H is erasing if some of its morphisms hi are 
erasing. 

If x is a PDOL word generated by p morphisms, and x — uvy for some 
u, v £ E* and y £ E°°, we say that the factor v of x occurs at morphism index i 
when i £ E p and i = |u| (mod p). 

DOL words are generated by DOL systems (E, h, s), i.e., PDOL systems 
(E, (h),s) consisting of one single morphism h. Following [6], we call the im- 
age of a DOL word under a coding (a letter-to- letter morphism) , a CDOL word, 
better known as morphic words. 

In the literature, one typically requires the morphisms hi to be non-erasing 
to ensure that the limit is infinite. We have taken a more general definition 
of PDOL-words, since also erasing morphisms may yield an infinite word in the 
limit. See Remark ll.3l below. 
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In the sequel it will be helpful to have a recursive definition of the map H. 

Lemma 1.2. Let H = (ho, hi, . . . , hp— 1) be a tuple of morphisms. For i e E p 
define Hi = (hi, . . . , h p -i, ho, . . . , hi— i) and the corresponding map Hi : E* — > 

H l {e)=e 

Hi(au) = hi(a)Hi+i(u) (a € T,, u € E* ) 

where addition in the subscript of H is taken modulo p. 

Then Hq = H with H the map defined in Definition \1.1[ and Hi(uv) = 
Hi(u)H i+ \ u \(v) for all u,V G E* and i G E p . □ 

Using this notation we now formulate the PDOL analogue of the usual con- 
dition for productivity of DOL systems. In Section [3] we show that productivity 
of PDOL systems in general is undecidable. Productivity has been studied in 
the wider perspective of term rewriting systems in [25| 1131 115[ 116] . 

Remark 1.3. Let (E, h, s) be a DOL system. We say that h is prolongable on s 
if h(s) = sx for some x G E* and h^x) ^ e for all i > 0. Then /i*(s) -< h i+1 (s) 
for all i > 0, and hence the limit h u '(s) = s x h(x) h 2 (x) ■ ■ ■ is infinite. The 
generalization of this condition to PDOL systems "H = (E, H, vq) is: (*) H(vq) — 
VqV\ for some i>i G E* such that v n ^ e for all neN, where v n G E* and z n G Ej, 
are defined by zq = and 

v n = H Zn _ t (v n -i) (n>2) 
2 n = Zn-i + (mod p) (n > 1) 

Then H n (vo) = i/' l_1 (?Jo)ti„ for all n > 1, and so (*) forms a necessary and suf- 
ficient condition for productivity of H, that is, for the limit H u (va) — v v\V2 ■ • ■ 
to be infinite. 

Definition 1.4. The subword complexity of an infinite word x G E w is the 
function p x : N — > N such that p x (n) is the number of factors (subwords) of x 
of length n. 

Proposition 1.5 f[12]L The subword complexity of DOL words, and hence of 
CDOL words, is at most quadratic. 

We first consider an example of an erasing PDOL system. 

Example 1.6. Let U = (E 3 , H, 0) with H = (ho, hi, ha) defined for all b G E 3 
as follows, where addition runs modulo 3: 

h (b) = b(b + l)(b + 2) hi(b)=e h 2 (b) = b + 2 

Then % is productive (by Proposition 1 1.8[) and generates the word 

H^iO) = 0121120101221201120212010120201001210 ■ • • 



5 



Definition 1.7. Let TL = (E, (ho, . . . , s) be a PDOL system. We say 1-L 

is locally uniform if every morphism /i^ is uniform, i.e, if for all i 6 E p there is 
fc.; e N such that ki = for all fceS. We say 'H is (globally) uniform if, 

for some fcgN, each /ij is fc- uniform (i € E p ). 

Obviously, a globally fc-uniform PDOL system is productive if and only if 
fc > 2. For locally uniform systems the condition is formulated as follows, and 
is easy to check. 

Proposition 1.8. Let H = (E, (Jio, . . . , h p -±), w) be a locally uniform PDOL 
system, where hi is ki-uniform. Let s(n) be defined by s(0) = and s(n + 1) = 
s(n) + ki with i = n (mod p). Then W is productive if and only if s(n) > n for 
all ri > \w\ . 

Proof. The word (w) can be defined as the limit of the sequence 

W\ w \,W\ w \ + 1 ,W\ w \ +2 , ■ ■ ■ 

of finite words defined for n > \w\ by 

w\ w \ = H(w) 

\ w n if n > \w n \ 

1 w n hi(w n (n)) if n < \w n \ and n = i (mod p) 

We have \wi w i \ = s(\w\) and by induction we get \w n \ — s(n) for every n > \w\. 
The limit lmin^oo w n is infinite if and only if we never get to the clause n > \w n \, 
which holds in turn if and only if s(n) > n for all n > □ 

Example 1.9. Let S = {0,1,2}, H = (h Q ,hi) with morphisms h Q , hi : S* — > 
E* for all a e E defined by 

h a (a) = a(a + l)(a + 2) h x (a) = {a + 2)(a + l)a 

(with addition modulo 3). Then the PDOL system (E, H, 0} generates the word 

H u (0) = 012021201210201021201210120102120 • ■ • 

This is the square- free Arshon word [3] (of rank 3), which Berstel proved to be 
an example of a CD0L word that is not a D0L word [1] ; see Seebold [53] for a 
generalization. That iJ"(L) can indeed be defined as a CD0L word follows from 
Proposition 11.101 

It is not hard to see that, when a word u is generated by a (globally) 
fc-uniform PDOL system, it is fc-automatic [2], i.e., u is the image of a cod- 
ing of the fixed point of a fc-uniform morphism. 

Proposition 1.10. Let fc > 2, and "H — (E,iJ, s) a k-uniform PDOL system. 
Then H u (s) is k-automatic. 
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Proof. Let H = (ho, . . . , h p ~i), where every hi is /c-uniform. We define the 
(fc-uniform) morphism g : E p x S — > S p x £ by 



where addition in the first entries runs modulo p, and for j £ bj € £ is such 
that ft,- (a) = 6 6i • ■ - h-x- Let s = s si • • -s g _i, t = (0, s )(l, si) ■ • • (q— l,s g _i), 
and m = iJ"(s). Then 



follows by induction on n. Hence T(g u (t)) = u with r the coding defined by 



One might wonder whether also locally uniform, productive PDOL systems 
always generate morphic words. Examples 11.111 and I f . f 21 show that this is not 
the case. 

Example 1.11 ([22]). Define the word F p £ {0, 1}" for every p > 2 by F p = 
i? w (0) where ({0, 1},H,Q) with H = (h , h p -i) is a PDOL system, and hi 
are morphisms defined by 



For example, the word F3 starts like this: 

010100110001011001000110011100010100001101010011 • ■ ■ 

Lepisto [22] proves that F p has more than quadratic subword complexity, for 
every p > 2. Hence, with Proposition 11.51 these PDOL words F p cannot be 
CD0L words. We note that, conversely, the existence of CD0L words that are 
not PDOL words was shown in |10) . 

Example 1.12 ( 6 ). A Toeplitz word [20] over an alphabet S is generated by 
a seed word u e S(S U {?})* with ? ^ E, as follows. Start with the periodic 
u u and then replace its subsequence of ?'s by the sequence itself. For example 
u = 12??? generates the infinite word T(u) = 121211221112221 • • •. Cassaigne 
and Karhumaki [6] show that all Toeplitz words are PDOL words; e.g., T(u) = 
-H^(l) where H = (ho, hi, /12) and ho(a) — 12a and h\(a) = /12(a) = a for all 
a E {1,2}. Moreover, from [H Theorem 5] it follows that PT(u)( n ) G <d(n r ) 
with r — log 1 5^ 1 o g3 — 3.15066, thus forming an alternative proof of what was 
established in [22]: there are PDOL words that are not CD0L. 

2 Fractran for Computing Streams 

Fractran [7] [8] is a universal programming language invented by John Hor- 
ton Conway. The simplicity of its execution algorithm, based on the unique 



g((h a )) = {ki,b )(ki + ••• (ki + k- lA-i) 



g n (t) = (0, u(0)> (1, «(!)> • • • (qk n - 1, u(qk n - 1)) 



r({i,a)) 



□ 
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prime factorization of integers, makes Fractran ideal for coding it into other 
formalisms. 

A Fractran program F is a finite list of fractions 

F = f I <» 

with rii,di positive integers. Let fi — ^j-- The action of F on an input inte- 
ger TV > 1 is to multiply TV by the first 'applicable' fraction that is, the 
fraction /; with i the least index such that the product TV' = N ■ fi is an integer 
again, and then to continue with TV'. The program halts if there is no applicable 
fraction for the current integer TV. 
For example, consider the program 

F=^- i i 
2-3' 2' 3 

and the run of F on input TV = 2 3 3 5 : 

2 3 3 5 -)• 2 2 3 4 5 1 ->■ 2 1 3 3 5 2 2°3 2 5 3 -)• 2°3 1 5 3 -> 2°3°5 3 . 

Note that each multiplication by | decrements the exponents of 2 and 3 while 
incrementing the exponent of 5. Once I is no longer applicable, i.e., when one 
of the exponents of 2 and 3 in the prime factorization of the current integer TV 
equals 0, the other is set to as well. Hence, executing F on TV = 2 a 3 b halts 
after max(a,6) steps with 5^(0,6) _ 

Thus the prime numbers that occur as factors in the numerators and denom- 
inators of a Fractran program can be regarded as registers, and if the current 
working integer is TV = 2 a 3 b 5 C ... we can say that register 2 holds a, register 3 
holds b, and so on. 

The real power of Fractran, however, comes from the use of prime exponents 
as states. To explain this, we temporarily let programs consist of multiple lines 
of the form 

ni n 2 n m 

a: > 011, > Oi2, > a m (2) 

ax a,2 a m 

forming the instructions for the program in state a: multiply TV with the first 
applicable fraction 5f and proceed in state «j, or terminate if no fraction is 
applicable. We call the states a\, . . . , a m in ([2]) the successors of a, and we say 
a state is looping if it is its own successor. 

For example, the program P a dd given by the lines 

a : > a, > p and p : > p 

3 1 5 

realizes addition; running in state a on TV = 2 a 3 b terminates in state (3 
with 2 Q+b 3 b . 

A program with n lines is called a Fractran-n program. A flat list of fractions 
fi,...,fk now is a shorthand for the Fractran- 1 program a : f\ — >■ a, f% — > 
a, . . . , fk — > a. Conway 8 explains how every Fractran-rt program (n > 2) can 
be compiled into a Fractran-1 program, using the following steps: 
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(i) For every looping state a, introduce a 'mirror' state », substitute « for all 
occurrences of a in the right-hand sides of its program line, and add the 
line 

1 

» : — — >• a 

(ii) Replace state identifiers a by 'fresh' prime numbers. 

(iii) For every line of the form ^ append the following fractions: 

nx ■ Qi n 2 ■ «2 nk ■ a m 

d\ ■ a ' g?2 • ol ' d m ■ a 

(preserving the order) to the list of fractions constructed so far. 



Let us illustrate these steps on the adder P a dd given above. Step (i) of splitting 
loops, results in 

2 ' 5 1 a a 3 ft 

6 1 5 

In step |(ii) | we introduce 'fresh' primes to serve as state indicators, for example, 



(a, 8j = (7,11,13,17). Finally, step (iii) we replace lines by fractions, to 
obtain the Fractran-1 program 

_2-5-» a /3 3-?\ P 
6 ■ a £> a 5 • p q 

Then indeed the run of F add on 2 a 3 b a ends in 2 Q+b 3 b /3. 

For 'sensible' programs any state indicator has value ('off') or 1 ('on'), and 
the program is always in exactly one state at a time. Hence, if a program F 
uses primes r\,...,r p for storage, and primes a%,...,a q for control, at any 
instant the entire configuration of F (= register contents + state) is uniquely 
represented by the current working integer N 

N = rl 1 r e 2 2 ■■■ a 3 

for some integers e\, . . . , e p > and 1 < j < q. 

The reason to employ two state indicators a and o to break self-loops in 
step |(i)| is that each state indicator is consumed whenever it is tested, and so 
we need a secondary indicator o to say "continue in the current state" . This 
secondary indicator o is swapped back to the primary indicator a in the next 
instruction, and the loop continues. 

Wc now introduce some further notation. For partial functions g : A — B 
we write g{x)\. to indicate that g is defined on x £ A, and g{x)'\ otherwise. 
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Definition 2.1. Let F = fi, . . . , fk be a Fractran program with = 5f G Q>o- 
We define the partial function : N — 1 N which, given an integer N > 1, 
selects the index of the first fraction applicable to AT, and is undefined if no 
such fraction exists, i.e., 

^f(N) = min {i | 1 < i < k, N ■ G N} , 

where we stipulate (min 0)f- We write tp(n) for short when _F is clear from the 
context. 

We overload notation and use F : N — N to denote the one-step computation 
of the program F, defined for all N > 1 by 

F(N) = N ■ f^ N) 

where it is to be understood that F(N)"[ whenever ip(N)^. The run of F on N 
is the finite or infinite sequence N, F(N), F 2 (N), .... We say that F halts or 
terminates on N if the run of F on JV is finite. 

The halting problem for Fractran programs is undecidable. 

Proposition 2.2 ( [141 Theorem 2.2]). The uniform halting problem for Fractran 
programs, that is, deciding whether a program halts for every starting integer 
N > 0, is II!]- complete. 

Proposition 2.3 ( |19l Theorem 68]). The input-2 halting problem for Fractran 
programs, that is, deciding whether a program halts for the starting integer N = 
2, is Ej 1 - complete. 

Remark 2.4. In some sense it does not matter which prime numbers are used 
in a Fractran program. Let us make this precise. Let p be a prime number, 
and n a positive integer. Then let v p (n) denote the p-adic valuation of n i.e., 
v p (n) — a with a e N maximal such that p a divides n. For p = (pi,p2j • • • ,Pt) we 
write vp(n) to denote (v pi (n) , v P2 (n) , . . . , v Pt (n)). Let F be a Fractran program 
with t distinct primes p = p\ , P2 , ■ ■ ■ , Pt , let q = q%, q?, ■ ■ ■ , qt be any vector 
of t distinct primes, and let G be the program obtained from F by uniformly 
substituting the q^s for the p^s. Then clearly, for all integers M,N > such 
that vp(M) = Vf(N), we have v^{F l {M)) = v^G^N)) for all i > 0. 

We employ Fractran programs to define finite or infinite words over the 
alphabet {0,1} by giving the primes 3 and 5 a special meaning, namely for 
indicating output and 1, respectively. The construction easily generalizes to 
arbitrary finite alphabets. 

Definition 2.5. Let F be a Fractran program. The finite or infinite word Wp 
computed by F is Wf = W(2) where W(N) = e if the sequence F(N), F 2 (N), . . . 
does not contain values divisible by 3 or 5, (note that this includes W(N) = e 
if F(iV)t), and otherwise 

U)W{F{N)) if3|F(A0, 
W{N) = I 1 W(F(N)) if 5 | F(N) and 3 \ F(N), 
[W(F(N)) otherwise. 
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So the word Wf is infinite if and only if F does not terminate on input 2 
and the run of F on N contains infinitely many numbers that are divisible by 
3 or 5. The infinite word can be read off from the infinite run by dropping all 
entries neither divisible by 3 nor 5, and then mapping the remaining entries to 
or 1, if they are divisible by 3 or 5 (and not 3), respectively. 

Example 2.6. The Fractran program |, |, | gives rise to the computation 
3, 5, 3, 5, 3,5,.. ., and hence computes the infinite word 010101 ... of alternating 
bits. 

Proposition 2.7. Every (finite or infinite) computable, binary word can be 
computed by a Fractran program. 

Proof. In |14) it is shown that Fractran programs can simulate any Turing ma- 
chine computation. By Remark 12.41 we may assume that this translation does 
not employ the primes {2, 3, 5}. Then a straightforward adaptation of the proof 
in |14j yields the claim: we multiply the fractions corresponding to the Turing 
machine generating an output or 1 by the primes 3 or 5, respectively, and 
make sure the thus introduced factor 3 or 5 is removed in the next step by 
putting fractions ^ and ^ in front of the program. □ 

We define a Fractran- n program and compile it to a Fractran- 1 program -Fbin 
which computes an infinite word that has every finite binary word as one of its 
factors. For this we use the bijective 'z-representation' defined as follows. 

Definition 2.8. Let £ = {0, 1}. For all n G N and w £ £*, we define (n) z G £* 
and [w] z e N by 

(0) z - e [e] z = 

(2n + l) z = 0(n) z [0w] z = 2[w} z + 1 

(2n + 2) z = l(n) z [lw] z = 2[w] z + 2 

and we let BIN denote the infinite word 

BIN = (0)z(l)z(2) z ••• = 1 00 10 01 11 000 100 010 •• • 

We will now define a Fractran program that computes BIN; it will be the 
compilation of the following Fractran-7 program: 



Oil ■ 


T2 

> 

T3 


&5 , 


n 

— > OL2 








a 2 ■ 


r-irz 
T\ 




1 






Pi 


a3 : 


r\ 

> 

r3 


, 


1 








0:4 : 


rs 

— -> 


(34 , 


- ->A>, 

T2 


i 


■> Pi 




a 5 : 


r3 


«5 , 


1 









ai 

Oil 
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We first explain its workings, and then compile it into a Fractran-1 program. 
Let ei,e2,e3 be the register contents of the current integer N, i.e., such that 
N = r^r^r^ 3 . In the run (= sequence of states) of the above program starting 
in ai with ei = e 2 = e 3 = 0, the subsequence of 'output' states (3 and /?i 
corresponds to the infinite word BIN. The idea is that r\ holds the current 
value n for producing the factor (n) z of BIN. State a\ with e 3 = increments ei, 
and the program proceeds in state a 2 . States a 2 and a 3 copy e\ to e 2 and we 
continue in state a^. State 0:4 subtracts 2 from e 2 while incrementing e 3 as long 
as possible (corresponding to division of r 2 by 2 and storing the quotient in 63), 
and then goes to output state /3q if the remainder e 2 7^ 0, and to output state /3i 
after decrementing e 3 , otherwise (corresponding to the definition of (-) z above). 
After any of the two output states, the program returns to state ot\. State a.\ 
with a non-zero quotient r 3 copies e% to e 2 using state a§, and then continues 
with state a^. 

We compile the above program into a flat list of fractions using the steps 
(i)-(iii) given above. For the looping states a 2 , 03, a^, and «5, we introduce 
mirror states » 2 , 03, 04, and 05. Second, we assign the following prime numbers 
to the identifiers: 

ai a 2 02 03 03 04 04 U5 »5 Po Pi Ti r 2 r 3 
2 7 11 13 17 19 23 29 31 3 5 37 41 43 



Finally, with (iii) we obtain the following Fractran-1 program: 
r 2 «5 ria 2 r 2 r 3 o 2 a 3 a 2 ric>3 0:4 a 3 



r 3 ai ai ria 2 a 2 » 2 r 3 a 3 a 3 « 3 
r 3 c>4 /3 /?i 04 r 2 c>5 0:4 as ct\ a\ 
t\ola ' r 2 a4 ' r 3 a4 ' 04 ' 7*3^5 ' "5 ' 05 ' /3q ' /3i 



which is run on input N = at = 2 to force the program to start in the initial 
state. We note that the huge number mentioned at the end of the introduction 
is the least common denominator of -Fein- 

Proposition 2.9. The word computed by Fbin is BIN. □ 



3 Productivity for Erasing PDOL Systems 

We show that the problem of deciding productivity of erasing PDOL systems 
is undecidable. The idea is to encode a given Fractran program F as a PDOL 
system Hf = (^,H,s) such that H u (s) is infinite if and only if F does not 
terminate on input 2. 

We consider Fractran programs of the form % , . . . , ^ ; every program can 
be brought into this form by taking d the least common denominator of the 
fractions. 

Definition 3.1. Let F = . . . , ^ be a Fractran program. We define the 
PDOL system U F = (T,H, s) where 

L {s, „, a, A, blM} 
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and H = (ho, . . . , hd-i) consisting of morphisms hi : T* — > T* defined for all 
i £ Ed as follows: 



hi(s 
hi(a 

MA 
hi<M 



= s J- 1 a a I J 1-1 

= £ 

_ J A J' 1 if i = d - 1 
1 e otherwise 

{ a "V(i) if ^(i) is defined 

e otherwise 

= f a 1 '^ b J^ 1 if V(i) is defined 
e otherwise 



(3) 
(4) 

(5) 

(6) 

(7) 

(8) 



Before we show that productivity of the PDOL system Hf coincides with F 
running forever on input 2, we give some intuition and an example to illustrate 
the working oIT-Lf- 

The following trivial fact is useful to state separately. 

Lemma 3.2. Let N,d,q,r £ N such that N = qd+r, and F a Fractran program. 
Then i/jf(N) = ipp(r)- If moreover b £ N divides d, then b \ N if and only if 
b\r. □ 

Let F be a Fractran program with common denominator d, and (finite or 

infinite) run Na,Ni,N2, Let $ e N and n G Ed such that Ni = qid + n, 

for all i > 0. We let x n be the 'contribution' of the iteration H n+1 , i.e., x n is 
such that H n+1 (s) = H n (s)x n . Then iJ w (s) = sx Q X!X 2 ■■■■ We will display 

( s ) in separate lines each corresponding to an x n . The computation of the 
word -ff^s) proceeds in two alternating phases: the transition from even to 
odd lines corresponds to division by d, and the transition from odd to even 
lines corresponds to multiplication by the currently applicable fraction v ' rf w ' ) . 
These phases are indicated by the use of lower- and uppercase letters, that is, 
%2n £ a, b } and X2 n +\ £ A, B}, as can be seen from the definition 
of the morphisms. Now the intuition behind the alphabet symbols (in view of 
the defining rules of the morphisms) can be described as follows. We use s as 
the starting symbol, and the symbol u is used to shift the morphism index of 
subsequent letters. 

In every even line xn 

(i) there is precisely one block of a 's; this block is positioned at morphism 
index and is of length Ni , representing the current value Ni in the run 
of F; 

(ii) b is a special marker for the end of a block of a 's, so positioned at 
morphism index n , the remainder of dividing Ni by d. 



13 



In every odd line x 2 i+i 

(iii) the number of A 's corresponds to the quotient and every occurrence 
of A is positioned at morphism index ; 

(iv) Bj (also at morphism index rj) takes care of the multiplication of the 
remainder with . Then ip(Ni) = ipfa) ensures, via Lemma [3.21 
that the morphism can select the right fraction to multiply with. 

We illustrate the encoding by means of an example. 

Example 3.3. Consider the Fractran program |, |, or equivalently 

27 10 

6 ' 6 

and its finite run 2,9,15,25. Following Definition 13.11 we construct the PDOL 
system Hp = (T, if, s) with if = (ho, • ■ • , hs) and 



hi(s) = 


s „ 5 a a b „ 5 


h (a) = 


. . . = hi( a ) = e 


Ma) = 




Km = 




h Q (A) = 


h 2 (k) = h i {k) 


MA) = 


a 10 


MA) = 


/15(A) = e 


/*>(■) = 




h2(W) = 


a 9 b J 


MB) = 


a 18 b . 5 


h 3 <m = 


a 5 b J 


hi(M) = 


/75(B) = e 



for i G Eg- Then H u (s) is finite and the stepwise computation of this fixed 
point can be displayed as follows. To ease reading, we write below each letter 
its morphism index. Let z n denote the morphism index of x n . Moreover, the 
word if" ( s ) = s xqX\ ■ ■ ■ is broken into lines in such a way that every line x n +i 
is the image of the previous line x n under H Zn (except for the line xq, which is 
the tail of the image of s under if = ffo). 

s 



xq = J' a a b u ° 

10 12 3 

xi = m~ 3 

2 3 
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x 2 = a a a a a a a a a b u 5 

0123450123 4 

x 3 = A J B J 

3 4 3 4 

x 4 = a a a a a a a a a a a a a a a b u° 

0123450123450123 4 

x 5 = A J 5 A „ 5 B J 

3 4 3 4 3 4 

x 6 = a a a a a a a a a a a a a a a a a a a a a a a a a b J 

01234501234501234501234501 2 

.t 7 = A J A J 5 A J 5 A J 5 B 

121212121 

x s — e 



Now we characterize the contribution of every iteration of H in the construc- 
tion of the word H u (2). We employ the notations given in Lemma Tl. 2 1 

Lemma 3.4. Let F = and N > 1. Let ?eN and r G Sd be such 

that N = qd + r. Let X = ^ 1 . Then we have 

ff(a jV bX) = (AX)l„' i - 1 - r (9) 

of length d(q + 1) — r. //, moreover, F(N) is defined, then 

ff r ((AX)«B„' i - 1 - r ) = a f W iX (10) 

of length F(N) + d. 

Proof. Equation © follows immediately by induction on g. To see that (|10[) 
holds for F(N)\., we note that tp(N) is defined, and so is ip(r) — ip(N), by 
Lemma 13.21 Hence we obtain 

^((AXj'B^ 1 -' ) = (a"*M ) q H r (B J- 1 -**) 

and we conclude by F(N) = N • = q ■ n^ (N) + r ■ ^f^. □ 

Lemma 3.5. For all Fractran programs F, the PDOL system Hf is productive 
if and only if F does not terminate on input 2. 

Proof. Let F and T-Lf be as in Dcfinition l3.il Let No, N\, iVa, . . . be the finite or 
infinite run of F on 2, i.e., N — F l (2), and let t G N U {oo} denote its length. 
For all i with < i < t, let S N and G be such that JV< = d + rj. 

We define x n G S* and z n G for all n > 0, as follows. Let X = , 
xq = a a b X, zo = 0, and, for n > 1, let a; n and z„ be such that iF l+1 (s) = 
H n {s)x n and z„ = |iF l (s) (mod d). Then _ff"(s) = s XX0X1X2 and the 
factor x n is at morphism index z n . With Lemma ll.2l we then have 

x n = H Zn _ 1 (x n -i) z n ee z n -i + \x n -i\ (mod d) (11) 
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for all n > 1 . Now we prove by induction on n > that 

if n = 2i < 2t, 
if n = 2i + 1 < 2t, 
if n > 2t. 

The base case is immediate. Let n > 0. If n. = 2i < 2i for some i < t, then 2Vj = 
F(iVj_i) is defined, and x n = if Zn _ 1 (x n _i) = ^^((AX)*- 1 !^- 1 ^- 1 ) = 

a Wl b X, and z„ = z„_i + \x n -i\ = r,_i + dfe-i + 1) - r,_i = (mod p), 
both by (jlip , the induction hypothesis and Lemma 13.41 If n = 2i + 1 < 2t for 
some i < t, then x„ = ff^.^^.i) = H ( a Ni b X ) = ( A X )«* B J*- 1 -^ , and 
z n = z n _i + \x n -x\ = + Ni + d = Ti (mod p), again by (|TT1) . the induction 
hypothesis and Lemma 13.41 Finally, if n = 2i, then x n — H Zn _ l (x rl -i) = 
H rt _ t {{ A X)^- 1 B „ d - 1 - r *- 1 ) = e, since F terminates on iV t _i (and so i/j(N t -x) 
and ip(rt-i) are undefined), and z„ = z n _i + |a;„_i| = rt_i+rf(gt_i + l)— rt-i 
0. Clearly, then also a;„ = e and z n = for all n > 2t. □ 

Hence, by Lemma 13.51 and Proposition 12.31 deciding productivity of PDOL 
systems is undecidable. 

Theorem 3.6. The problem of deciding on the input of a PDOL system H 
whether Ti is productive, is Ill-complete. □ 

4 Turing Completeness 

of Non-Erasing PDOL Systems 

In this section we extend the encoding of Fractran from the previous section to 
show that every computable infinite word can be embedded in the following two 
ways. 

Definition 4.1. Let S and T D S be alphabets with letters I , r £ T\E, and 
let w £ E w and u G be infinite words. We say w is prefix embedded in u if 
the following three conditions are satisfied: 

(i) for every finite prefix v of w there is an occurrence I w j in ti, 

(ii) for every occurrence of a word I v r in u with v G (r\{r})* we have that 
v is a prefix of w, and 

(iii) letters from S occur in u only in factors (subwords) of the form I v t\ with 
«G(r\{r})*. 

Definition 4.2. Let E and F D S be alphabets, and let w G S w and u G T u 
be infinite words. We say w is sparsely embedded in u if w is obtained from u 
by erasing all letters in r \ E. 



x n = a N * b X z n = 

— s z n — 
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The crucial difference with the encoding of Section [3] is that we now use 
the knowledge about the remainder not only to select the correct fraction to 
multiply with, but also to recognize when the current value is divisible by 3 or 
5, and correspondingly produce an output bit or 1, cf. Definition 12.51 The 
process again proceeds in two phases, for division and multiplication, and we 
employ lower- and uppercase letters accordingly. We introduce letters I (and L ) 
and r (and R ) marking the beginning and the end of the prefix of the infinite 
word computed by the Fractran program. Furthermore, the symbol R produces 
the output bits depending on the current remainder r*. In order to prevent that 
the output of R changes the morphism index of R , we introduce Q (and fl) 
which compensate the production of R with an inverse length. The letter ■ 
(and E ) marks the end of the line, and additionally e takes care of realignment 
after multiplication, such that the first a in each run stands on morphism index 
0. 



Definition 4.3. Let F = ^f, . . . , ^ be a Fractran program such that (without 
loss of generality) the common denominator d is divisible by 3 and 5. Define 
the PDOL system Hf = (T, H, s) with 

T= {s, o, a, A, I, L, 0, 1, r, R,e, E,0} 

and H = (ho, . . . , consisting of (non-erasing) morphisms hi : T* — > T* 

defined for every i € £ d as follows: 



hi(s) 


= s J- 1 a 


M-) 




hi(a) 


j A J- 1 


Mi) 


= B 


hi(U) 


= z 


MO 


= L 


hi{M) 


= R 


him) 


= E o d - 1 


hi(o) 


d 


MA) 


is 


MB) 


fa^ 
IE 


Ml) 





ML 



if i = d — 1 , 
otherwise. 



d~l 



if ip(i) is defined, 
otherwise. 

| if is defined, 
otherwise. 

if 3 | i or 5 | i , 
otherwise. 
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MO) 


= 




Mi) 


= 1 






( O r 


if 3 | i , 


MR) 


-{;■ 


if 5 | i , 






otherwise. 


Ml) 


=■ 




MH) 







Remark 4.4. In Definition 14.31 we require that 3 and 5 divide the common 
denominator in order for Lemma 13.21 to apply. Informally speaking, via the 
remainder we can only observe factors that also divide the common denominator. 

Remark 4.5. Let F and H be as in Definition 14.31 It can be shown that the 
symbol Q occurs in the word H 1 ^ ( s ) if and only if the Fractran program F 
halts on input 2. This fact can be used to show that it is undecidable whether 
B1 occurs in H u (s). However, we prove this differently, namely by applying 
Theorem 14.91 and using the fact that for non-terminating Fractran programs it 
is undecidable whether digit 1 occurs in the sequence computed by the program. 
See Theorem [4. 151 

Time for an example. 

Example 4.6. Consider the following Fractran program: 

2 3-7 3 5 2 
~ 7' 2~5' 2' 3' T 

which has the infinite run 2, 3, 5, 10, 21, 6, 9, 15, 25, 50, 105, 30, 63, 18, 27, 45, 
75, 125, 250, 525, 150, . . . and computes the word 

011000011000000011000000000011000000000000011 

that is, X 11 3+1 ll 6+1 ll 9+1 ll 12+1 . . . = UZo° 3 ' i+ln - Writing the 
program with the common denominator 210 yields: 

60 441 315 350 420 
~~ 210' 210' 210' 210' 210 

Let Hf be the PD0L encoding of F, as given in Definition 14.31 We consider the 
first steps of the iteration of the morphisms; for easier reading, we drop blocks 
of consecutive symbols J 210 (they do not change the morphism index of other 
letters), and let X = J 09 . 

s 

O 

x = Xa 2 bX|„ 208 I rXe 

.-lOCimtN CO i — I CM CO CN 

xi = XHXl„ 208 L R X E o 208 
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.7'2 



Xa 3 bXB„ 208 I rXe 

t-h O CO ^ CO CMtl tO 

x 3 = XBXl„ 208 L R X E o 207 

>J 51 M CN CO C*3 

xa = Xa 5 iXBJ 07 I rXe 

i-H OlOQDIO t£l ID tO ifl 

x 5 = X B X Z . 207 L R X E o 20; 

to in to m is co^LOtom a 

x 6 = X a 10 b X z J 06 I 1 r X e 



x 7 = X B X Z „ 206 L 1 R X E o 2 



206 l 1 R Y c - 200 

i-l O O rH (- CO O O i-l O rH 

x 8 = X a 21 b XlU 205 I 1 1IXH 



X B X Z „ 205 L 1 1 R X E o 



IN') 



Note that the number of a 's in rows Xn precisely models the run of the Fractran 
program 2, 3, 5, 10, 21, .... By construction, the morphism index of R equals 
the remainder r^, and consequently R produces the prefix Oil of the word 
computed by F. 

Things become more interesting if the number of a 's exceeds the denomina- 
tor 210. We look at a few steps further down the sequence: 

~ V -.250 u Vh 191 ■ hYh 

^36 — s\ d U /\ B i_i lonoooonoooooooi' /\c 

rH C C: r-t O rH ^ i i S S . J ! ! ! ! I ^ ^ " q h q 

X37 — X A X B X Z u 191 L 01100001100000001 RXE O 170 

r-i a r-l C ^ O 1 — 1 M 5!3S(.SSS3SI5SUSS Q H Q H 

^38 — X a 525 b X z „ 190 1 01100001100000001 1 r X e 

0000a) 0000 
#39 — X(AX) 2 BXZ u 190 Lonoooonoooooooil R X Ej O 105 

CO lO <D fD IO !D q Ill.gS.I.i.i 

OCO OOO oo> 

£40 = X a 150 b X z u 189 1 01100001100000001 1 r X 



In line X36 we have 250 a 's giving rise to only one A in the subsequent line 
X37. Again, A, B, Z and R stand on index 40 = 250 (mod 210), and conse- 
quently both deduce that the first applicable fraction is ||g = The letter 
A represents the quotient from the division by 210, and hence produces 441 
a 's. The letter B is responsible for the multiplication of the remainder, and 
thus produces 40 • |^ = 84 a 's. Thus we get 441 + 84 = 525 a 's in line a; 38 . 
Moreover, R produces a 1 since 250 is dividable by 5 but not 3, and B produces 
an X -block (X = J 209 ) to keep R and the remaining symbols on the correct 
index. 

Now the division of 525 by 210 has quotient 2 and remainder 105, and so 
we have two A's in line £39, and all A's, H< [Z and R standing on index 105. 
The first applicable fraction for 105 is — |, and correspondingly the two 
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A 's produce 60 a 's each, and B produces 30 = 105 • | a 's, in total giving rise 
to 150 a 's in line x^q. Now 525 is divisible by 3 and so R produces a 0. 



Til n k 



We now start working towards a proof of Theorem 14.91 Let F = 2f , . . . . ; 
be a Fractran program. We again employ the notation Hi as given in Lemma [1.2l 
Furthermore, we define relations -»^,~ C x S* by 

= {(uJv,uv) | u,v G £*} ~ = (^u->)* 

Then clearly we have if* (it) ~ Hi(v) for all t G Sd, u,v G £* with it ~ u. This 
allows us to prove properties of H"(s) reasoning modulo ~. Below, we write 
„ n with n < to denote the block J 71 with m G £<2 and n = m (mod d). For 
TV > 1 we define 

k(JV) =0 if 3 | TV 

re(AT) =1 if 3 f ^ and 5 | TV 

k(TV) = e otherwise 

Lemma 4.7. Let F = . . ,^f, and N > 1. Let k = k(JV) and X = J*" 1 . 
Let q G N and r G &e smc/i i/jai N = qd + r, and let v G { , 1 } . Then we 
have 



//|iXa v |X|j' 2 ' IwrXB) 

= X(A X) 9 B X Z j J -' 2 -\ v \ LuRX E o d ' 



(12) 



of length equivalent to —r modulo d. 

Moreover, if in addition F(N) is defined, then we have 

H r+1 {X(AX) q B X Z ^- 2 -M LvRXE o d - r ) 
~ Xa F W bXzJ- 2 > K l \ VK iXfl 

o/ length equivalent to F(N) modulo d. 



(13) 



Proof. lfl"2"|) follows immediately: Let x,y G T* be arbitrary. Then Hi(Xx) = 
XH {x), and H {a N y) = X(A X) 9 id (a r y) = X ( A X) q H r (y). Furthermore, 
ffrlXl^ 2 - 1 " 1 hrXe) = BXZ !U "-l l, l LuR7J r (B) and J? r (B) = o d ~ r . 

To show (HSJ), let F(N) be defined. Then tjj(N) = ip{r) is also defined 
(Lemma l3.2p . Hence we get, for w G T* arbitrary: 

7d. r+ i(X(AX) f 'BXw) 
= XH r ((AX) q BXw) 
= X(a"*W u X) 9 £T r (HXM;) 

= X(a n *w -XJ'a*"^ bXi/ r (w) 
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Finally, to compute H r (w) for w = ^J~ 2 ~\ v \ LuRXE o d ~ r , distinguish the 
following cases: If 3 or 5 divides r, then 3 or 5 divides also N, respectively, by 
Lemma l3.2l Hence we have k — k(N) — k(t) and | n \ = 1, and 

H r (w)=iX ji-2-M | v H r ( R X E o d - r ) 

= mX- d ~ 2 ~ lvl \ vk rXe *- r ' 
^ a j-3-\v\ | MrXe 

as required. And , if 3 { r and 5 \ r, then n = k(N) = K,(r) = e, and 
H r (w) ~ m J-z-\v\ \ vr X e, as required. □ 

Lemma 4.8. Let F be a Fractran program computing an infinite word w. Then 
the PDOL system Wf from Definition \4-3\ generates a word that prefix embeds 
w (see Definition ^. lty . 

Proof. Let F = . . . , ^ be a Fractran program, computing w £ { , 1 i.e., 
w = Wf with Wf as defined in Definition ^. 51 Let No, N%, N%, . . . be the infinite 
run of F starting on Nq = 2 (so with infinitely many N divisible by 3 or 5). 
Let qi £ N and £ be such that Ni = qid + ri. Let Hf = (I\ H, s } be the 
PDOL system defined in Definition 14.31 

We show that u = i? w (s) satisfies the conditions (i), (ii), and (hi) of Def- 
inition 14. 1| by characterizing the contribution of every iteration of H . For 
every i £ N we let Vi £ S* be defined by vq — e and Vi + \ — Vin(Ni) so 
that w — limi-i.oo Vi. For all n > let x n £ T* and z n £ be such that 
H n+1 (s) = H n (s)x n and z n = \H n {s)\ (mod d). Then H u (s) = sx a x! ■ ■ ■ , 
and 

X n = -Hzn-iCZn-l) Zn = Zn-1 + \%n-l\ (mod d) (14) 

Let us abbreviate Yi = u d ~ 2 -KI . We prove that x n and z n satisfy 

x 2l ~ Xa* b XBV, Ir, r X e 
x 2l+1 ~ X ( A X ) 9 * B X Z Y, L Vi R X E o d ~^ 

Z2i = 1 (mod d) 
Z2i+i = n + 1 (mod d) 

by induction on n. 

For the base case, we see z$ = | s | = 1 and xq = XaabXz „ d ~ 2 I r X e , as 
required. So let n > 1. If n = 2i for some i > 1, it follows from (|T4"|). Lemma |4~71 
and the induction hypothesis that 

Xn — H Zn _ 1 (x n —±J 

~ fl- ri _ 1+ i(X( A X)"- 1 B XR.ifc.JXlo^-i ) 
~Xa w -b XH^IuiBXB 
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and z n = z„_i + |x„_i| = r^i + 1 - r, ; _i = 1 (mod d). 
Similarly, if n = 2i + 1 for some i > 0, we obtain 

~ffi(Xa JVi bXHFJ^rXe) 
~Xa%' bX|F 1+1 l %1 rXe 

and z„ = z n _i + |a?„_i | = 1 + iVj+i = n + 1 (mod d). 

Knowing the exact shape (modulo ~) of u = H u (s), it is now easy to verify 
that m it satisfies (i), (ii), and (iii), taking into account that ^ does not occur in 
any factor I v r of u with v G (T\ { r })*, by the definition of the morphisms. □ 

We are ready to collect our main results. 

Theorem 4.9. Every computable infinite word can be prefix embedded in a 
PDOL word (see Definition ^. 

Proof. Let w G {0,1}" be an infinite computable word. Then, by Proposi- 
tion 12. 71 w is computed by some Fractran program. By Lemma 14.81 the claim 
follows. □ 

Definition 4.10. Let F be a Fractran program, and Hp the PDOL system 
given in Definition ^. 31 We define the PDOL system T-L' F as the result of replacing 
in T-Lf the rules hi(Q) — and hi ( 1 ) = 1 by hi ( ) = ^ and hi ( 1 ) = ^ , for 
all t G Ed- 
Lemma 4.11. Let F be a Fractran program computing an infinite word w, and 
let u G Y u be the PDOL word generated by the W F defined in Definition ^. 1U\ 
Then w is sparsely embedded in u. 

Proof. By an easy adaptation of the proof of Lemma 14.81 noting that every 
output and 1 is produced precisely once and in the next iteration replaced 
by „. □ 

Theorem 4.12. Every computable infinite word can be sparsely embedded in a 
PDOL word (see Definition\4-.2fy. 



Proof. Analogous to the proof of Theorem l4.9[ replacing Lemma l4.8l bv Lemma 

on □ 

It is known that the set of morphic words is closed under finite state trans- 
ductions [U Theorem 7.9.1]. In particular, if we erase all occurrences of a certain 
letter from a morphic word, the result is a morphic (or finite) word. From The- 
orem 14.121 it follows that this is not the case for PDOL words, establishing a 
negative answer to Problem 29 (1) and (2) of 23 . 

Corollary 4.13. The set of PDOL words is not closed under finite state trans- 
ductions. 
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Proof. There are computable streams that are not PDOL words [TU]; hence the 
class of PDOL words is not closed under finite state transductions, by Theo- 
rem 14.121 (erasing letters is a finite state transduction) . □ 

Finite state transducers play a central role in computer science. The trans- 
ducibility relation via finite state transducers (FST) gives rise to a hierarchy 
of degrees of infinite words [17] , analogous to the recursion theoretic hierarchy. 
But, in contrast to the latter, the FST-hierarchy does not identify all computable 
streams. An open problem in this area is the lack of methods for discriminating 
infinite words it, v, that is, to show that there exists no finite state transducer 
that transduces it to v. Discriminating morphic words seems to require heavier 
machinery than arguments based on the pumping lemma. 

We will now collect several immediate consequences of Theorem 14.91 First 
of all, we have solved the open problem 22 on the existence of PDOL words 
that have exponential subword complexity. 

Theorem 4.14. There is a PDOL word u such that p u (n) > 2™. 

Proof. Let F = -Fein be the Fractran program defined in Section [2] computing 
the word W F = BIN (Proposition [23]). Furthermore, let U F = (T,H, s) be the 
PDOL system of Definition 14.31 Then, by Lemma 14. 8[ it = H u ( s ) is the word 
we are looking for. □ 

Lemma I4T51 also allows us to give a negative answer to [331 Problem 29 (3)]. 

Theorem 4.15. The following problems are undecidable: 
Input: PDOL system H = (T, H, s), letter beT 

Question: (i) Does b occur in H u (s)? 

(ii) Does b occur infinitely many times in H LJ (s)? 

Proof. We show that the following problem is undecidable: given a Fractran 
program F computing an infinite word w over the alphabet {0,1}, does the 
letter 1 occur in w? This suffices since by Lemma \A. 81 if u is the infinite word 
generated by %f, then the letter 1 occurs in it if and only if 1 occurs infinitely 
often in it if and only if 1 occurs in w. 

We use the input 2 halting problem for Fractran programs which is Ej 1 - 
complete by Proposition 12.31 Let F be an arbitrary Fractran program. By 
Remark 12.41 we can replace the primes in F to obtain a program F' that does 
not contain the primes {2, 3, 5} such that F' halts on 7 if and only if F halts on 
2. We now extend F' to F" by adding in front the fraction ^ and at the end 
the fractions | and j. Then the first fraction of F" starts F 1 on input 7 and 
ensures that the output is for every step that F' is running, and only when F' 
terminates, the last two fractions of F" switch the output to 1 and keep running 
forever. □ 
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From Theorem l4.15l it follows immediately that the first-order (and monadic 
second-order) theory of PDOL words is undecidable, answering [23l Problem 28] ; 
see [53] also for the definition of the first-order and monadic theory of a sequence. 
This again stands in contrast to the case for morphic sequences, which are known 
to have a decidable monadic second-order theory [SJ. 

Corollary 4.16. The first-order theory of PDOL words is undecidable. 

Also immediate from Theorem 14.151 is the undecidability of equivalence of 
PDOL systems (equality of the limit words they generate) . We note that equiv- 
alence of DOL systems is decidable [9] , whereas that of CDOL words is an open 
problem. 

Corollary 4.17. Equality of PDOL words (given by their PDOL systems) is 
undecidable. 

Proof. We reduce problem (i) stated in Theorem 14.151 to equivalence of PDOL 
systems, as follows. Let H = (£, H, s) be a PDOL system and b £ S, and let 
W = (S U {&'}, H', s') where b' <£ £ and H' and s' are obtained from H and s 
by replacing all occurrences of b by b', and letting H'(b) — b. Then b does not 
occur in the word generated by % if and only if % and %' generate the same 
word. By Theorem 14. 151 this is undecidable. □ 

5 A Concrete PDOL Word 

with Exponential Subword Complexity 

In this section we give a concrete example of a PDOL system which generates 
an infinite word with exponential subword complexity. The word embeds all 
prefixes of the word BIN = (0) z (l) z (2) z • • • given in Definition 12.81 We refrain 
from proving that it indeed does have this property; the existence of such a 
PDOL word is already proved in the previous section, see Theorem 14.91 

We define a PDOL system H — (ho, hi, . . . , h\§) consisting of 16 morphisms. 
We express morphism indices i £ Ei6 by linear combinations 

i = a(i) • 2 3 + r(i) ■ 2 2 + c(i) • 2 1 + o(i) ■ 2°. 

with a(i),r(i),c(i),o(i) £ {0,1} which we call flags. We use these flags to 
transmit information between symbols: 

• a(i) = 1 stands for active, 

• r(i) = 1 stands for running, 

• c(i) = 1 stands for carry flag, and 

• o(i) = 1 stands for output one. 
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The idea is to simulate a binary counter, using the representation of Dcfini- 
tion !2.8l The counter repeatedly increments (+1) the current value, and thereby 
brings (n) z to (n+l) z . During an increment process we need to shift the activity 
from bit to bit. To this end, the activity flag a(i) indicates whether a symbol 
at morphism index i is active. 

We explain the increment process using the following example word. Here 
... is the already produced prefix of BIN, and we assume for the moment that 
the symbols a , b and d each stand for a word of length 16, and B for a word 
of length 8. 



Here a and b represent the bits and 1 , respectively, and we shall con- 
tinue to call them bits. Ignoring the H's in between, (fT5j) represents the word 
0110 (in turn representing the integer 21). Apart from incrementing this initial 
word a b b a , it is at the same time 'copied' bit by bit to the word 0110 between 
symbols L and R. The least significant bit is left, and consequently the process 
of incrementing will proceed from left to right. The symbol c (being of length 
8) swaps the value of a(i) for the morphism index i of all subsequent letters. 
Note that between the n-th and (n + l)-th occurrence of bits (a or b), there 
are 2 n ~ 1 >'s. Hence, if the first bit is active, then this is the only active bit. 

We now describe the transition from (fT5"|) to its PDOL image (fl6| . Note that 
starting from the first occurrence of ■, every second occurrence in (|15[) has the 
activity flag set. When the symbol c is active, it will be eliminated, that is 
replaced by the symbol d (of assumed length 16), thus activating the next bit 
for the next iteration (JTGj) . 



Note that a is replaced by B ; uppercase letters are used for indicating the 
already processed bits during the increment loop. When the increment loop is 
finished, uppercase will be turned to lowercase, and the process restarts. The 
switch from a to B corresponds to incrementing. This is controlled by the 
carry flag indicating whether a bit has to be flipped. The carry flag is always 
set at the start of an increment loop. To keep this example simple we do not 
display this flag. At the end of this section we give the first iterations of the 
PDOL system displaying all flags. 

Notice that in (fT6j) the second bit b is the only active bit (ignoring B which 
we have already dealt with). Again, eliminating the active H's will shift the 
activity to the following bit: 



acbccbccccaL...R 



(15) 



Bdbcdbcdcdal OR 



(16) 



BdBddbcdddaL...01R 



(17) 



After one more step we obtain: 



BdBddBddddaL...011R 



(18) 
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and finally: 



BdBddBddddAL...0110R (19) 

As soon as the most significant bit a is active, R becomes active as well. This 
can be used to recognize when the addition is finished, and then R unsets the 
bit r(i) to restart the addition procedure. 

The active bit makes use of the flag o(i) to 'communicate' with the sym- 
bol R whether to output a or 1 . This actually means that R can produce 
the or 1 only two iterations later; for simplicity we have in this intuitive 
explanation abstracted from this technicality and produce the 's and 1 's in 
the immediately following iteration (after a bit has become active). 

There are more symbols and technical subtleties to be explained, but we 
leave this to the imagination of the reader. Enjoy! 

The morphisms hi are defined for all i G Si6 as follows: 

h(s) = s J 3 . a J 5 u JM~ 15 m~ 15 M- u ^M~ 

M-) = - 



2 

16 



hi(o) 

hi(-k) = J o 
hi(0) = 
hi(l)= 1 

hi(a) 
MA) 
hi(b) 
MB) 

htm 

hi(d) 







if 


14 


★ A 


if 


14 


* J 2 . 2 B 


if 


a 


if — ir (i) 






if r(i) 




' b 


if ->a(i) 




B 


if a(i) A -ic( 


:o 


A 


if a(i) A c(i 




b 


if -ir(i) 




B 


if r(z) 






if -ia(i) 




d J 


5 ifo(i) 




uJ 


; if ->r(i) 




d 


if r(i) 





2G 





f P 


if — ic(i) 


MP) = < 


p 


if c(i) A -io(i) 




u 


if c(i) A o(i) 




[■ 


if — ic(i) 


hi(0)= < 




if c(i) A -io(i) 




; 


if c(i) A o(i) 






if — ia(i) 


MO) = < 




15 if a(i) A 




; : : 


15 P „ 15 if a(i) A 






if ->r(i) 


Ml) = < 




15 if r(i) A -ia(j) 




M 


J 5 if r(i) A a{i) 



MB 

''/(■) 
MH) 

ML) 



MR) 



MRi! 
fti(R 2 ; 



3 (0 



L 



R 





R 3 





R 3 




1 


R 3 


1 


R 3 



if — ir (i) 

if r(i) A -<o(i) A -ia(i) 

if r(i) A -io(i) A a(i) A -ic(i) 

if r(i) A -*>(£) A a(i) A c(i) 

if r(i) A o(i) A -io(i) 

if r(i) A o(i) A a(i) A ->c(i) 

if r(i) A o(i) A a(i) A c(i) 



R 

Ri 
R 2 



The PDOL word H"(s) starts as follows: 

sJ 3 ma J 5 cJP J 5 J 5 1 J 4 L Ri J 



14 15 15 



, 29 * J 2 . 2 B J 5 d J 5 P J 5 „ 15 H„ 14 LR„ S 

13 14 10 12 13 12 13 12 13 12 13 12 13 11 12 13 



J 9 ( J o ) g J5 d J5 p Q J5 „15 



. 2!) L R 2 

6 3 4 5 



27 



^4 Q 10 71 g 15 j 15 p 15 Q 15 Q l~>^g 29 

14 2 12 3434343434 3 41234 



(J 2 o) 4 J23i b J 5 H~ 23 P ~ 15 o J 5 U 15 M- 29 L Ri 

12 S 15 15 0787878 7 8567 



For compactness, we continue without displaying the symbols The length n 
of blocks J 1 matters only modulo 16, and can be deduced from the morphism 
indexes of the surrounding letters. 

AdPoOZLOR 

15 15 15 15 15 15 13 14 15 



A d a d d P Zs L 1 R3 

7777777 7 456 7 

★ 4 o 8 A d a d d P Z 2 L 1 R2 

4 3333333 3 012 3 

o 4 acaccPOZiL0lRi 

14 15 15 7 7 15 7 7 7 4 5 6 7 

*« 2 BdacdPOZL01R 

13 10 12 12 12 12 4 4 4 4 1 2 3 4 



oBd*AddPOZL010R 

12 13 13 11 12 12 12 12 12 12 8 9 10 11 12 



BdoAddPoOZaLOlOORa 

444555555 5 01234 5 



★ 4 o 10 B d A d d P o Z 2 L 1 R2 

14 2 33333333 3 14 15 012 3 



o 4 bcaccPoOZiL0100Ri 

14 15 15 77 15 77 7 7 23456 7 



AdacdPoOZLOlOOR 

15 15 15 15 7777 7234567 



2N 



Ad* 


• 2 BddPoOZ 


L01 


1 R 


15 15 13 


10 12 12 12 12 12 12 12 


6 7 8 


9 10 11 12 


Ado 


BddPooOZ 3 


L 1 


1 I 


AAA 




14 15 


12 3 4 



R 3 



* 4 o 10 A d B d d P o o Z 2 L 1 1 R2 

14 2 333333333 3 12 13 14 15 012 3 




o 4 acbccPooOZiL010010Ri 



14 15 15 7 7 15 7 7 7 7 




0123456 7 


*. 2 BdbcdPoo 





Z L 1 1 R 


13 10 12 12 12 12 4 4 4 4 


A 


4 13 14 15 1 2 3 4 


oBdBddPooO 


Z 


L 1 1 R 


12 13 13 13 13 13 13 13 13 13 


13 


5 6 7 8 9 10 11 12 13 


BdBddPoooO 


z 3 


L 1 1 1 R3 


555555555 5 




12 13 14 15 1 2 3 4 5 









★ 4 o 10 BdBddPoooO Z2LOIOOIOOIR2 

14 2 3 3 3 3 3 3 3 3 3 3 3 10 11 12 13 14 15 1 2 3 




o 4 b c b c c P o o o Zi L 1 1 1 Ri 

14 15 15 77 15 7 777 7 7 14 15 0123456 7 



AdbcdPoooOZLOlOOlOOIR 

15 15 15 15 777777 7 14 15 01234567 



A d A d d P 





OZL010010011R 


15 15 15 15 15 15 


15 15 15 


15 15 5 6 7 8 9 10 11 12 13 14 15 


Ad Ad d a 


d d d 


dPOZ 3 L010010011lR3 


7 7 7 7 7 7 




77 7 7 12 13 14 15 0123456 7 


+ 4 o 8 AdA 


d d a 


ddddPOZ 2 L010010011lR2 


4 3 3 3 


3 3 3 


3 3 3 3 3 3 3 8 9 10 1 1 12 13 14 15 1 2 3 
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o 4 acaccaccccPOZiL010010011lRi 

14 15 15 7 7 15 7 7 15 7 15 7 7 7 12 13 14 15 1 2 3 4 5 6 7 



*« 2 BdacdacdcdPOZL0100100111R 



13 10 12 12 12 12 4 4 4 12 12 4 4 



4 9 10 11 12 13 14 15 1 2 3 4 



oBd*AddacdddPOZL01001001110R 

12 13 13 11 12 12 12 12 12 4 4 4 4 4 4 S 9 10 11 12 13 14 15 1 2 3 4 



BdoAdd*AddddPOZL010010011100R 

12 12 12 13 13 13 11 12 12 12 12 12 12 12 12 15 1 2 3 4 5 6 7 8 9 10 11 12 



6 Discussion 



In Section [5] we have encoded the state of a binary counter using a binary en- 
coding. In comparison with the binary counter obtained from the Fractran 
encoding, this yields an enormous simplification concerning the number of re- 
quired morphisms. Moreover, we have illustrated a construction which allows 
for shifting the activity from one letter to the next in each iteration of the 
morphism, and how the letters can 'communicate' computation results to the 
following letter. It would be interesting to investigate whether Turing machines 
can be encoded in a similar way. The crucial difference would be that for Turing 
machines we need to shift the activity left or right depending on the outcome 
of the current step; the binary counter always shifts the activity to the right. 
It is unclear to us whether the encoding from Section [5] can be extended in this 
direction. Compared to our Fractran encoding of Section [4j such an encoding of 
Turing machines could lead to significantly less morphisms (but with a slightly 
larger alphabet). 
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